Information Technology "Key to Text" for Semantic Search and Indexing of Textual Information - An Essential Tool for Electronic Publishing
نویسنده
چکیده
Introduction The electronic editions gives essentially new features to structure and organization for searching information by the reader and the information services providers. Before the computer revolution any edition on a library shelf or under a veil of a dust on a desk, before the reader took it in his hands, meant no more than was written in its catalogue card. (Certainly, we here do not speak about the editions surrounded with light of legends). Only the electronic edition is capable to speak at the top of its voice even in the absence of the reader. The complete dictionary index of the accessible editions, which 30 years back was the dream of any visitor of the scientific library, today has become the present damnation. Let's imagine a reader who wants to find verses about love (about the real love). He will receive a vast list of references on 10, 20, 30, … ways of love, 1001 nights of love, legal, psychological, physiological features of love of sexual minorities, on love to the Fatherland and not love to certain characters. But he searched another matter! His wishes and ideas aspired to something different. He has simply formulated a search image, and the results of the search only hide his idea of love behind a detailed lexical map of the use of the word "love". Fortunately, it is possible to use the skill of the electronic editions to speak (we shall recollect Ahmatova's verse: «I have learned women to speak, but God, who will force them to stop?») into a channel of intelligent, purposeful dialogue with the prospective reader. We shall in this paper discuss the technology ensuring such dialogue on the basis of the automated computed semantic search and analysis of the textual information. This dialogue is important not only for the reader, who hungers for the information he wants. It is extremely important for the author or publisher too because of the importance of the authentical prediction of the ways how to understand how the published text is understood by defferent categories of readers. N.B.! In the Appendix the results of the analysis of the text of the presented paper by the proposed methods are given: a set of key words as they are represented to the reader of the newspaper "Times". The fragments of the text included into the computed summary are underlined in the text of the paper. …
منابع مشابه
A Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملEXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS
Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملبررسی کاربرد فناوری معنایی برای سازماندهی اطلاعات در نرمافزارهای کتابخانه دیجیتالی
The present study was an attempt to investigate the use of semantic technologies to organize information in digital library software systems. The present study was a practical one which employed a descriptive survey method. The study sample consisted of three digital library software systems entitled Pars Azarakhsh, Parvan Pajoh, and Payam Mashregh. Data were collected through a checklist incl...
متن کاملمروری بر معتبرترین مجلههای میکروبشناسی پزشکی، بهار 1393
Background and Objective: Publishing articles in specialized journals that are prestigious indexing with international distribution have led scientists to validate the results of the research area. In this case, it may increase and promote the situation of the scientist and the country that research has been done in it. Considering the importance of this issue, the present study aimed to i...
متن کامل